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RAW SEQUENCE LISTING 

PATENT APPLICATION USm/419,305 



DATE: 02/15/2000 
TIME: 19:00:20 



INPUT SET: S34768,mw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING 
2 

3 (1) General Information: 
4 

5 (i) . APPLICANT: MARUTA, Kazuhiko 

6 KUBOTA, Michio 

7 SUGIMOTO, Toshiyuki • 

9 (ii) TITLE OF INVENTION: RECOMBINANT THERMOSTABLE ENZYME WHICH 

10 FORMS NON-REDUCING SACCHARIDE FROM REDUCING AMYLACEOUS 

11 SACCHARIDE 

12 . . ■ ' 

13 (iii) NUMBER OF SEQUENCES: 19 

14 " - ■ ' . - ; ■ 

15 ( iv ) CORRESPONDENCE ADDRESS : 

16 (A) ADDRESSEE: Browdy and Neimark 

17 (B) STREET: 419 Seventh Street N,W. Ste, 300 

18 (C) CITY: Washington 

19 (D) S.TATE: D.C., 

20 (E) COUNTRY: U.S.A. 

21 (F) ZIP: 20004 
22 

2 3 (V) COMPUTER READABLE FORM: 

24 . (A) MEDIUM TYPE: Floppy disk 

25 ' (B) COMPUTER: IBM PC compatible 

26 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

27 (D) SOFTWARE: Patentin Release #1.0, Version #1.30 

28 * ■ . ■ ■ 

29 (vi) CURRENT APPLICATION DATA: 

30 , (A) APPLICATION NUMBER: 09/419,305 

31 (B) FILING DATE: 

32 (C) CLASSIFICATION: 
33 

,34 (vii) PRIOR APPLICATION DATA: 

35 (A) APPLICATION NUMBER: US/08/505,448 

36 , (B) FILING DATE: 21-JUL-1994 

37 *. 

38 (A) APPLICATION NUMBER: JP 190183/1994 

39 (B) FILING DATE: 21- JUL- 1994 
40 

41 (vii) PRIOR APPLICATION DATA: 

42 (A) APPLICATION NUMBER: JP 189706/1995 

43 ' (B) FILING DATE: 04-JUL-1995 

44 ' 

45 (viii) ATTORNEY/AGENT INFORMATION: 

46 (A) NAME: Browdy, Roger. L. 
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INPUT SET: S34768.raw 

47 (B) REGISTRATION NUMBER: 25,618 

48 (C) REFERENCE/ DOCKET NUMBER: MARUTA=3 
49 

50 . (ix) TELECOMMUNICATION INFORMATION: 

51 (A) TELEPHONE: (202) 628-5197 

52 * (B) TELEFAX: (202) 737-3528 
53 

54 

55 ■ / , 

56 (2) INFORMATION FOR SEQ ID NO : 1 : 

57 ' . 

58 (i) SEQUENCE CHARACTERISTICS: 

59 (A) LENGTH: 720 amino acids 
'60 . (B) TYPE: amino acid 

61 (D) TOPOLOGY: linear 

62 

6 3 (ii) MOLECULE TYPE: protein 

64 - , 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

66 . , 

67 Val lie Ser Ala Thr Tyr Arg Leu Gin Leu Asn Lys Asn Phe Asn Phe 

68 1 5 10 15 
69 

70 Gly Asp Val lie Asp Asn Leu Trp Tyr Phe Lys Asp Leu Gly Val Ser 

71 20 25 30 . 
72 

73- His Leu Tyr Leu Ser Pro Val Leu Met Ala Ser Pro Gly ' Ser Asn His 

74 35 40 45 

75 ■ 

76 Gly Tyr Asp Val lie Asp His Ser Arg lie Asn Asp Glu Leu Gly Gly 

77 50 55 60 

78 - 

79 Glu Lys Glu Tyr Arg Arg Leu lie Glu Thr Ala His Thr lie Gly Leu 

80 65 70 75 80 
81 

82 Gly lie lie Gin Asp lie Val Pro Asn His Met Ala Val Asn Ser Leu 

83 .85 90 95 
84 

85 Asn Trp Arg Leu Met Asp Val Leu Lys Met Gly Lys Lys Ser Lys Tyr 

86 100 105 110 
87 

88 Tyr Thr Tyr Phe Asp Phe Phe Pro Glu Asp Asp Lys lie Arg Leu Pro 

89 115 120 ' 125 
90 

91 lie Leu Gly Glu Asp Leu Asp Thr Val lie Ser Lys Gly Leu Leu Lys 

92 130 135 140 , 
93 

94 lie Val Lys Asp Gly Asp Glu Tyr Phe Leu Glu Tyr Phe Lys Trp Lys 

95 145 150 155 160 
96 

97 Leu Pro Leu Thr Glu Val Gly Asn Asp lie Tyr Asp Thr Leu Gin Lys 

98 165 170 ' 175 
99 
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100 Gin Asn Tyr Thr Leu Met Ser Trp Lys Asn Pro Pro Ser Tyr Arg Arg 

101 180 185 190 
102 

103 Phe Phe Asp Val Asn Thr Leu lie Gly Val Asn Val Glu Lys Asp His 

104 195 ' 200 205 
105 

106 Val Phe Gin Glu Ser His Ser Lys lie Leu Asp Leu Asp Val Asp Gly 

107 210 215 220 
108 

109 Tyr Arg lie Asp His lie Asp Gly Leu Tyr Asp Pro Glu Lys Tyr lie 

110 225 230 235 240 
111 

112 Asn Asp Leu Arg Ser lie lie Lys Asn Lys lie lie lie Val Glu Lys 

113 ' 245 250 255 
114 

115 lie Leu Gly Phe Gin Glu Glu Leu Lys Leu Asn Ser Asp Gly Thr Thr 

116 260 265 270 
117 

118 Gly Tyr Asp Phe Leu Asn Tyr Ser Asn Leu Leu Phe Asn Phe Asn Gin 

119 275 280 285 
.120 

121 Glu lie Met Asp Ser lie Tyr Glu Asn Phe Thr Ala Glu Lys lie Ser 

122 290 . 295 300 
123 

124 lie Ser Glu Ser lie Lys Lys lie Lys Ala Gin lie lie Asp Glu Leu 

125 305 310 315 320 
126 

127 Phe Ser Tyr Glu Val Lys Arg Leu Ala Ser Gin Leu ,Gly Ile Ser Tyr 

128 325 - 330 335 
129 

130 Asp lie Leu Arg Asp Tyr Leu Ser Cys lie Asp Val Tyr Arg Thr Tyr 

131 340 345 350 
132 

133 Ala Asn Gin lie Val Lys Glu Cys Asp Lys Thr Asn Glu lie Glu Glu 

134 355 360 365 
135 

136 

137 Ala Thr Lys Arg Asn Pro Glu Ala Tyr Thr Lys Leu Gin Gin Tyr Met 

138 370 375 380 
139 

140 Pro Ala Val Tyr Ala Lys Ala Tyr Glu Asp Thr Phe Leu Phe Arg Tyr 

141 385 390- 395 400 

142 ' 

143 Asn Arg Leu lie Ser lie Asn Glu Val Gly Ser Asp Leu Arg Tyr Tyr 

144 405 . 410 415 
145 

146 Lys lie Ser Pro Asp Gin Phe His Val Phe Asn Gin Lys Arg Arg Gly 

147 420 425 430 
148 

149 Lys lie Thr Leu Asn Ala Thr Ser Thr His Asp Thr Lys Phe Ser Glu 

150 435 440 445 
151 

152 Asp Val Arg Met Lys lie Ser Val Leu Ser Glu Phe Pro Glu Glu Trp 
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206 (i) SEQUENCE CHARACTERISTICS: 

207 (A) LENGTH: 2160 base pairs 

208 (B) TYPE: nucleic acid 

209 (C) STRANDEDNESS: single 

210 (D) TOPOLOGY: linear 
211 

212 (ii) MOLECULE TYPE: cDNA 

213 

214 

215 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

216 

217 GTGATATCAG CAACCTACAG ATTACAGTTA AATAAGAATT TTAATTTTGG TGACGTAATC 60 
218 

219 GATAACCTAT GGTATTTTAA GGATTTAGGA GTTTCGCATC TCTACCTCTC TCCTGTCTTA 120 
220 

221 ATGGCTTCGC CAGGAAGTAA CCATGGGTAC GATGTAATAG ATCATTCAAG GATAAACGAT 180 
222 

22 3 GAACTTGGAG GAGAGAAAGA ATACAGGAGA TTAATAGAGA CAGCTCATAC TATTGGATTA 240 
224 

225 GGTATTATAC AGGACATAGT ACCAAATCAC ATGGCTGTAA ATTCTCTAAA TTGGCGACTA 300 
226 

227 ATGGATGTAT TAAAAATGGG TAAAAAGAGT AAATATTATA CGTACTTTGA CTTTTTCCCA 360 
228 

22 9 GAAGATGATA AGATACGATT ACCCATATTA GGAGAAGATT TAGATACAGT GATAAGTAAA . 4 20 

230 ' 

231 GGTTTATTAA AGATAGTAAA AGATGGAGAT GAATATTTCC TAGAATATTT CAAATGGAAA 4 80 

232 

233 CTTCCTCTAA CAGAGGTTGG AAATGATATA TACGACACTT TACAAAAACA GAATTATACC 540 
234 

2 35 CTAATGTCTT GGAAAAATCC TCCTAGCTAT AGACGATTCT TCGATGTTAA TACTTTAATA 6 00 

236 

2 37 GGAGTAAATG TCGAAAAAGA TCACGTATTT CAAGAGTCCC ATTCAAAGAT CTTAGATTTA 6 60 

238 

239 GATGTTGATG GCTATAGAAT TGATCATATT GATGGATTAT ATGATCCTGA GAAATATATT 720 
240 

241 AATGACCTGA GGTCAATAAT TAAAAATAAA ATAATTATTG TAGAAAAAAT TCTGGGATTT 780 
242 

24 3 CAGGAGGAAT TAAAATTAAA TTCAGATGGA ACTACAGGAT ATGACTTCTT AAATTACTCC 840 
244 

245 AACTTACTGT TTAATTTTAA TCAAGAGATA ATGGACAGTA TATATGAGAA TTTCACAGCG . 900 
246 

24 7 GAGAAAATAT CTATAAGTGA AAGTATAAAG AAAATAAAAG CGCAAATAAT TGATGAGCTA 960 
248 

24 9 TTTAGTTATG AAGTTAAAAG ATTAGCATCA CAACTAGGAA TTAGCTACGA TATATTGAGA 1020 
250 

251 GATTACCTTT CTTGTATAGA TGTGTACAGA ACTTATGCTA ATCAGATTGT AAAAGAGTGT 1080 

252 . . 

253 GATAAGACCA ATGAGATAGA GGAAGCAACC AAAAGAAATC CAGAGGCTTA TACTAAATTA 1140 
254 

.255 CAACAATATA TGCCAGCAGT ATACGCTAAA GCTTATGAAG ATACTTTCCT CTTTAGATAC 1200 
256 

257 AATAGATTAA TATCCATAAA TGAGGTTGGA AGCGATTTAC GATATTATAA GATATCGCCT 1260 
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